Derivative-free reinforcement learning: a review
نویسندگان
چکیده
Reinforcement learning is about agent models that make the best sequential decisions in unknown environments. In an environment, needs to explore environment while exploiting collected information, which usually forms a sophisticated problem solve. Derivative-free optimization, meanwhile, capable of solving problems. It commonly uses sampling-and-updating framework iteratively improve solution, where exploration and exploitation are also needed be well balanced. Therefore, derivative-free optimization deals with similar core issue as reinforcement learning, has been introduced approaches, under names classifier systems neuroevolution/evolutionary learning. Although such methods have developed for decades, recently, exhibits attracting increasing attention. However, recent survey on this topic still lacking. article, we summarize date, organize aspects including parameter updating, model selection, exploration, parallel/distributed methods. Moreover, discuss some current limitations possible future directions, hoping article could bring more attentions serve catalyst developing novel efficient approaches.
منابع مشابه
Reinforcement Learning: Model-free
Simply put, reinforcement learning (RL) is a term used to indicate a large family of dierent algorithms RL that all share two key properties. First, the objective of RL is to learn appropriate behavior through trialand-error experience in a task. Second, in RL, the feedback available to the learning agent is restricted to a reward signal that indicates how well the agent is behaving, but does ...
متن کاملA Review of Reinforcement Learning
Monte Carlo methods, and temporal difference learning are teased apart, then tied back together in a unified way. Innovations such as backup diagrams, which decorate the book cover, help convey the power and excitement behind reinforcement learning methods to both novices and veterans like us. The book consists of three parts, one dedicated to the problem description and two others to a range o...
متن کاملMultitask model-free reinforcement learning
Conventional model-free reinforcement learning algorithms are limited to performing only one task, such as navigating to a single goal location in a maze, or reaching one goal state in the Tower of Hanoi block manipulation problem. It has been thought that only model-based algorithms could perform goal-directed actions, optimally adapting to new reward structures in the environment. In this wor...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Frontiers of Computer Science
سال: 2021
ISSN: ['1673-7350', '1673-7466']
DOI: https://doi.org/10.1007/s11704-020-0241-4